High performance logistic regression for privacy-preserving genome analysis

نویسندگان

چکیده

Abstract Background In biomedical applications, valuable data is often split between owners who cannot openly share the because of privacy regulations and concerns. Training machine learning models on joint without violating a major technology challenge that can be addressed by combining techniques from cryptography. When collaboratively training with cryptographic technique named secure multi-party computation, price paid for keeping private an increase in computational cost runtime. A careful choice techniques, algorithmic implementation optimizations are necessity to enable practical over distributed sets. Such tailored kind Machine Learning problem at hand. Methods Our setup involves two-party computation protocols, along trusted initializer distributes correlated randomness two computing parties. We use gradient descent based algorithm logistic regression like model clipped ReLu activation function, we break down into corresponding protocols. main contributions new protocol function requires neither comparison protocols nor Yao’s garbled circuits, series engineering improve performance. Results For our largest gene expression set, train 7 billion multiplications; completes about 26.90 s local area network. The this work further optimized version which won first place Track 4 iDASH 2019 genome analysis competition. Conclusions paper, present its implementation, subprotocol securely compute function. To best knowledge, fastest existing high dimensional across

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Privacy-Preserving Logistic Regression

Logistic regression is an important statistical analysis methods widely used in research fields, including health, business and government. On the other hand preserving data privacy is a crucial aspect in every information system. Many privacy-preserving protocols have been proposed for different statistical techniques, with various data distributions, owners and users. In this paper, we propos...

متن کامل

Privacy-preserving logistic regression

This paper addresses the important tradeoff between privacy and learnability, when designing algorithms for learning from private databases. We focus on privacy-preserving logistic regression. First we apply an idea of Dwork et al. [6] to design a privacy-preserving logistic regression algorithm. This involves bounding the sensitivity of regularized logistic regression, and perturbing the learn...

متن کامل

Privacy Preserving Regression Residual Analysis

Regression analysis is one of the most basic statistical tools for generating predictive models that describe the relationship between variables. Once a model has been generated, numerous goodness-of-fit measures are used to evaluate the degree to which the model characterizes the relationship between the variables under consideration. The analysis of regression residuals is one such measure, w...

متن کامل

PrivLogit: Efficient Privacy-preserving Logistic Regression by Tailoring Numerical Optimizers

Safeguarding privacy in machine learning is highly desirable, especially in collaborative studies across many organizations. Privacy-preserving distributed machine learning (based on cryptography) is popular to solve the problem. However, existing cryptographic protocols still incur excess computational overhead. Here, we make a novel observation that this is partially due to naive adoption of ...

متن کامل

Privacy-Preserving Regression Algorithms

Regression is arguably the most applied data analysis method. Today there are many scenarios where data for attributes that correspond to predictor variables and the response variable itself are distributed among several parties that do not trust each other. Privacy-preserving data mining has grown rapidly studying the scenarios where data is vertically partitioned. While algorithms have been d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: BMC Medical Genomics

سال: 2021

ISSN: ['1755-8794']

DOI: https://doi.org/10.1186/s12920-020-00869-9